Approximate String Matching in R


1d <
  • Mislav Šagovac

    Mislav Šagovac   i have used fuzzy match package and it was very slow on big datasets. For me it was better to use string dis package and write your own simple function.

    5h <
  • Mislav Šagovac

    Mislav Šagovac   and it would be good to use a set od string distances. better than only one. the best approach would be to use machine learning method that would learn which match is right.

    4h <
  • Abdelouahed Ben Mhamed

    Abdelouahed Ben Mhamed   you can use the function pmatch()

    4h <
  • Brendan Morse

    Brendan Morse   This paper by Gaston Sanchez is very comprehensive and easy to follow. 
    http://gastonsanchez.com/Handling_and_Processing_Strings_in_R.pdf

    3h <
  • Gabriel Gomes

    Gabriel Gomes   I use stringdist package. This package has differente algorithm of distance to string, so I this the best.

  • <